Picture for Pengcheng Jiang

Pengcheng Jiang

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Add code
Jun 01, 2026
Viaarxiv icon

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Add code
May 29, 2026
Viaarxiv icon

Retrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation

Add code
May 13, 2026
Viaarxiv icon

Learning to Predict Future-Aligned Research Proposals with Language Models

Add code
Mar 28, 2026
Viaarxiv icon

A Multi-objective Evolutionary Algorithm Based on Bi-population with Uniform Sampling for Neural Architecture Search

Add code
Feb 09, 2026
Viaarxiv icon

Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs

Add code
Feb 07, 2026
Viaarxiv icon

Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation

Add code
Feb 03, 2026
Viaarxiv icon

Adaptation of Agentic AI

Add code
Dec 22, 2025
Figure 1 for Adaptation of Agentic AI
Figure 2 for Adaptation of Agentic AI
Figure 3 for Adaptation of Agentic AI
Figure 4 for Adaptation of Agentic AI
Viaarxiv icon

GRACE: Generative Representation Learning via Contrastive Policy Optimization

Add code
Oct 06, 2025
Viaarxiv icon

Topic Coverage-based Demonstration Retrieval for In-Context Learning

Add code
Sep 15, 2025
Viaarxiv icon